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1 Introduction 

Suppose that a; is a physical quantity whose evolution is governed by a deter- 
ministic force which has small random fluctuations; such a phenomenon can be 
described by the following equation 

x = f(x) + h(x)$(t) (1) 
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where x = 4|, and £ is a "white noise" . Intuitively, a white noise is the derivative 
of a Brownian motion, namely a continuous function which is not differentiablc 
in any point. 

There is no function £ which has such a property, actually the mathematical 
object which models £ is a distribution. Thus equation {I} makes sense if it 
lives in the world of distributions. 

On the other hand the kind of problems which an applied mathematician 
asks are of the following type. Suppose that x(0) = and that £(£) is a random 
noise of which only the statistical properties are known. What is the probability 
distribution P(t, x) of x at the time tl 

This question can be formalized by the theory of stochastic differential equa- 
tion and eq. (P) takes the form 

dx = f(x)dt + h(x)dw. (2) 

thus, the white noise dw is regarded as the "differential" of a Wiener process 
w. In this case, both x(t) and w(t) are modelled, not by distributions, but by 
stochastic processes. 

By the Ito theory, the above question can be solved rigorously: the proba- 
bility distribution can be determined solving the Fokker-Plank equation: 

Eq. (TTJ) (or |(3J)) and §5§ are very relevant in applications of Mathematics 
and the practitioners of mathematics such as engineers, physicists, economists, 
etc. make a large use of it. However the mathematics used in these equations 
is rather involved and many of them are not able to control it. 

Usually people think of some intuitively simpler model. For example, £(£) 
is considered as a force which acts at discrete instants of time t , ; it is supposed 
that the difference of two successive times dt = ti+i — t% is infinitesimal and its 
strength is infinite; namely 

m = ±4tt w 



'dt 

The sign of this force is determined by a fair coin tossing. Clearly eq. ([4]) 
makes no sense and the gap between the rigorous mathematical description and 
the intuitive model is quite large. 

The main purpose of this paper is to reduce this gap. We will use infinite 
and infinitesimal numbers in such a way that eqs. (UJ) and (fT]) make sense and, in 
this framework, we will deduce eq.® rigorously. Our proof is relatively simple 
and very close to intuition. 

The use of infinite and infinitesimal numbers naturally leads to Nonstandard 
Analysis (NSA). Actually, some mathematicians have described the stochastic 
differential equation by Nonstandard Analysis (cf. e.g. [2], [S], pQ, [TT] and 
references therein) . However the machinery of NSA is too complicate for prac- 
titioners of mathematics even if its ideas are simpler. A good knowledge of 
formal logic is necessary to follow a NSA proof, in fact, the main tool is the 



transfer principle which, in order to be applied correctly, needs the notions of 
formal language and interpretation. 

In this paper we will not use NSA but a-theory which is a kind of simplified 
version of Nonstandard Analysis, a-theory has been introduced in [5] (see also 
[1] and [3]) with the purpose to provide a simpler approach to NSA. In fact, 
in the quoted paper it has been proved that a particular model of NSA can be 
deduced by the axioms of a-theory (we refer also to [6] and to [7] for the reader 
interested to investigate in these questions). 

The main differences between a-theory and the usual Nonstandard Analysis 
are two: 



• 



• 



a-theory does not need the language (and the knowledge) of symbolic 
logic; 

it does not need to distinguish two mathematical universes, (the standard 
universe and the nonstandard one). 



a-theory postulates the "existence" of an infinite integer number called a 
and it provides the rules necessary to deal with the mathematical objects which 
can be constructed by its introduction. For example, a-theory allows to define 
functions such as " sin (at) " and to manage with it. a-theory is not as powerful 
as NSA, but it is simpler and it allows to treat many problems by an elementary 
and rigorous use of infinite and infinitesimal numbers. 

In particular, using this theory, it is possible to define the "grid functions" 
which are functions defined for times U belonging to a set H which models the 
axis of time. Using the notion of grid function, we are able to give a sense to 
(Q} and ((4]) and to deduce eq. (J3]) rigorously. 

Our approach presents the following peculiarities: 

• we will rewrite eq. (JTJ) as a "grid" differential equation: 

^f = /(*)+£(*) ( 5 ) 

where ^| denotes the grid derivative (see Def. [TH]). From this equation, 
it easy to recover both a distribution equation and a stochastic equation, 
and, at the same time, eq. ([5]) has a very intuitive meaning. 

• when eq. ^ is considered from the stochastic point of view, the noise £ 
is regarded as a grid function belonging to the space of all possible noises 
1Z. If £ is regarded as a random variable, the probability on the sample 
space TZ can be defined in a naive way, namely every noise has the same 
probability. This is the basic idea of the Loeb measure ([9]) which is an 
important tool in the applications of NSA , but we do not need to use it. 
Actually we do not need to introduce any kind of measure. 



2 The Alpha- Calculus 

2.1 Basic notions of Alpha-Theory 

In this section, we will expose the basic facts of a-theory and the basic tools 
which will be used in the paper in a elementary and self contained way. 

a-theory is based on the existence of a new mathematical object, namely 
a which is added to the other entities of the mathematical universe. We may 
think of a as a new "ideal" natural number added to N, in a similar way as the 
imaginary unit i can be seen as a new ideal number added to the real numbers 
R. Before going to the axioms for a, we remark that all usual principles of 
mathematics are implicitly assumed. Informally, we can say that, by adopting 
a-theory, one can construct sets and functions according to the "usual" practice 
of mathematics, with no restrictions whatsoever. A precise definition of what we 
mean by "usual principles of mathematics" (i.e. of our underlying set theory) 
is given in [5]. 

Like the use of the imaginary entity i is governed by simple properties like 
i 2 = — 1 and the usual rules for the product and sum, the use of a is governed 
by the following five axioms. 

al. Extension Axiom. 

Every sequence <p can be uniquely extended to N U {a}. The corresponding 
value at a will be denoted by ip(a) and called the value of ip at the point a or 
more simply "a-value" . If two sequences (p,ip are different at all points, then 

tp{a) ^ip{a). 



We remark that if p : N — > A, then in general ip(a) ^ A. The "difference 
preserving" condition given above can be rephrased as follows: "If two sequences 
are different at all n then they must be different at the point " a" as well" . It is 
a non-triviality condition, that will allow plenty of values at infinity. Moreover 
remark that the a-value of a sequence should not be confused with its limit. In 
fact, the a-value differs from a limit even for this first axiom; in fact different 
sequences might have the same limit. 

The next axiom gives a natural coherence property with respect to compo- 
sitions. If g : A — > B and h : B — > C, denote by h o g : A — > C the composition 
of h and g, i.e. (h o g)(x) = h(g(x)). 

olI. Composition Axiom. 

If ip and ip are sequences and if f is any function such that compositions f o tp 
and f o tp make sense, then 

tp{a) = tp(a) ^>{fo ip)(a) = (/ o ^)(a) 

So, if two sequences takes the same value at infinity, by composing them with 
any function we again get sequences with the same a-value. 



ct3. Real Number Axiom. 

If c r : n t—> r is the constant sequence with value r, then c r (a) = r: if ljj : n i— ► 

n is the immersion of N in K. then ln(oc) — a (£ R. 

This axiom simply says that, for real numbers, the notions of constant se- 
quence is preserved at infinity. The latter condition says that the ideal number 
ex is actually a new number. Thus the immersion In provides a first example 
of sequence <p : N — > N such that (p(at) ^ N. 

a4. Internal Set Axiom. 

If ip is a sequence of sets, then also ip(cx) is a set and 

ip(a) — {tp{ot) : tp(n) € ip(n) for all n} . 

Thus, the membership relation is preserved at infinity. That is, if ip(n) G 
i/j(n) for all n, then (p(a) e ip{ct). Besides, all elements of ip( a ) are obtained in 
this way. That is, they all are values at infinity of sequences which are pointwise 
members of tp. The set considered above will be called Internal sets. 

a5. Pair Axiom. 

If i?(n) = {ip(n),ip(ri)} for all n, then $(a) = {(p(a),tl>(a)}. 

Thus, if the sequence £ is such that either £(n) = <p(n) or ^(n) = ip(ri) for 
all n, then either £(a) — ip(cx.) or £(a) — ip(a) at infinity as well. As a straight 
consequence of the last two axioms, any constant sequence with value a finite 
set of natural numbers, or a finite set of finite sets of natural numbers etc., takes 
the same value at infinity as well. We remark that this is not true in general. 

We remark that the above five axioms are given somewhat "informally" . 
Precise indications for a rigorous formulation as sentences of a suitable first- 
order language are given in [5]. Also, we refer to [5] for the proves of the 
propositions below, but we suggest the reader to try them by himself to get 
acquainted with a-theory. 

Definition 1 If A is a set, the ^-transform of A is defined as follows: 

A* ={<p(a) :ip:N^ A}. 

If ip is a sequence such that ip(n) = A for all n,then by the Internal Set 
Axiom, we have that ip(a) = A*. Then constant set- valued sequences behave 
differently than real valued sequences (cf. the Real Number Axiom). 

Definition 2 The set of the hyperreal numbers is the * -transform of the set of 
the real numbers: 

R* = {(p(a) : ip:E^R}. 



In other words, the hyperreal numbers are the a-values assumed by real 
sequences. With obvious notation, for instance we will write sin — to mean the 
hyperreal number obtained as the values at infinity of the sequence {sin ^} . 

The sum and product operation are naturally transported on the hyperreal 
set moreover we have the following: 

Proposition3 The hyperreal number system (R*; +, •, 0, 1, <) is an ordered 
field. 

Besides the considered sets of hyper-numbers, another fundamental notion 
in nonstandard analysis is the following. 

Definition 4 A set V C A* is called hyperfinite if 

T = {ip{a) : <p(ri) e A n } 

where A n C A is a sequence of finite sets. Given a hyperfinite set F, we define 
its cardinality \T\ as follows: 

\T\ =y;(a)eN* 

where tp (n) = \A n \ is the cardinality of the finite set A n . 

In general hyperfinite sets are infinite; their importance relies in the fact that 
they retain all "elementary" properties of finite sets. Applications of hyperfinite 
sets will be given in subsequent sections, for example the following hold 

Proposition 5 Every nonempty hyperfinite subset of M* has a greatest and a 
smallest element. 

A very important example of hyperfinite set which we will use in this paper 
is the hyperfinite grid H. The hyperfinite grid H Q is defined as the a-value of 
the set 



namely, 



l n = < - : ke Z, < k < — 

1 n 2 ~ 2 



a ' 2 ~ 2 



In the follwing, for short, usually we will write M instead of H Q . Clearly HI is 
an hyperfinite set with |H| = a 2 . Given a, b £ H, we set 

[a,b]m = {x eU:a<k<b} 
[a,b)m = {x e H : a < k < b} 



If we identify the functions with their graphs, /* is defined by definition [T] 
and it is not difficult to prove the following 



Proposition 6 Let f : A — ► B be a function. Then its star-transform f* is a 
function f* : A* — ► B* and, for every sequence ip : N — > ^4, 

r(¥»(a)) = (/o ¥> )*(a) 

Moreover, f* is 1-1 (or onto) iff f is 1-1 (or onto, respectively). 

When confusion is unlikely, we will omit the symbol "*" and "/*" will be 
denoted by"/". 

Let /„ : A — > B be a sequence of functions; then identifying the functions 
with their graphs f a is well defined by axiom (a3) and we have that 

f a ■ A* -> 5* 

is a function defined by 

f a (f(aj) = ip (a) 

where ip (n) := fn(<p(n)) is a sequence in B. 

Definition 7 /I function 

f : A* -► B* 

is called internal if it is the graph of an internal set, namely if there is a sequence 
of functions f n :A^B such that 

I = fa 

2.2 Infinitesimally small and infinitely large numbers. 

A fundamental feature of a-calculus is that the intuitive notions of " infinitesi- 
mally small" number and " infinitely large" number can be formalized as actual 
objects of the hyperreal line. This give many possibilities to simplify proofs and 
statements in calculus theory. 

Definition 8 A hyperreal number £ G R* is bounded or finite if its absolute 
value |£| < r for some r € R. We say that £ is unbounded or infinite if it is not 
bounded. £ is infinitesimal if |£| < r for all positive r £ R. 

Clearly, the inverse of an infinite number is infinitesimal and vice versa, i.e. 
the inverse of a (nonzero) infinitesimal number is infinite. An example of an 
infinitesimal is given by @ := 1/a, the a -value of the sequence {1/n}. 

From now on, the symbol © will always denote 1/a. 

All infinitesimal and all real numbers are bounded. However there are finite 
hyperreals that are neither infinitesimal nor real, for example 5+© and 7+sino:. 

Definition 9 We say that two hyperreal numbers £ and r\ are infinitesimally 
close if £ — r) is infinitesimal. In this case we write £ ~ rj. 



It is easily seen that ~ is an equivalence relation. 

On the other hand (as it is intuitive) each bounded hyperreal is infinitely 
close to some real. The following indeed comes from the completeness of the 
real line. 

Theorem 10 (Shadow Theorem) Every bounded hyperreal number £ is in- 
ftnitesimally close to a unique real number r, called the shadow of!;. Symbolically 
r = sh{£). 

The notion of a shadow is extended to every hyperreal number, by setting 
sh(t;) = +00 if £ is positive unbounded, and sh(£) — —00 if £ is negative 
unbounded. 

Definition 11 Given two hyperreal numbers £ and £ S R*\ {0} , we say that 
they have the same order i/£/C an d C/£ are bounded numbers and we will write 

(notice the difference between " ~ " and " s=s " since these symbols will be largely 
used in the rest of this paper). We say that £ has a larger order than £ if £,/Q is 
an infinite number and we will write 

e»c 

We say that £ has a smaller order than C, if £/C * s on infinitesimal number and 
we will write 

2.3 Some notions of infinitesimal calculus 

Now we see how all this machinery can be used to build a rigorous "infinitesimal" 
calculus. We present how the definition of limit can be given in our setting. 

Definition 12 We say that lim x ^ Xo f{x) = I «//*(£) ~ I for all £ ~ x (£ ^ 

x ). 

With the definition of limit all the elementary calculus can be reconstructed, 
but the features of our method allow to avoid the use of limits and work with real 
infinitesimal and infinite numbers. Let us see some example: the definition of 
continuity and derivative. We remark that the theory given by these definitions 
is equivalent to the standard calculus and all the known results (as for example 
the Lagrange's or Fermat's theorems) applies. 

Definition 13 A real function f : A — > M is continuous at xq € A if for every 

£eA*,£~x ^.r(0~/*M. 

Let / be any real function defined on a neighborhood of xq ■ 



Definition 14 We say that f has derivative at xq if there exists f'(xo) € K 
such that for all infinitesimals e^O, 

r(x + e)-r(x ) 

J (xo) 

£ 

Equivalently, f has derivative f'(xo) at xq if for every infinitesimal e there 
is an infinitesimal S such that f*(xo + e) = f(xo) + f'(xo)s + 5s. 

As said before all the classical results of calculus hold in this framework. 
An example which will be used in the following is the Taylor formula (with 
infinitesimal remainder) . 

Theorem 15 If f € C n+1 (R) then for each infinitesimal e there is an infinites- 
imal r\ such that 



r(x + e) = J2 



f(k)( x y 



k<n 



ne 



Now we introduce a concept of integral. This concept is more general than 
the Riemann integral and will allow us to integrate noises and stochastic equa- 
tions. Intuitively this integral is just an infinite sum of hyperreal numbers. This 
sum will be done on an hypcrfinite set. 

Definition 16 If T = x( a ) * s a hyperfinite set of hyperreal numbers, then its 
hyperfinite sum: 

y x = Sum x (a) 

x<=r 

is defined as the value at infinity of the sequence of finite sums 

Sum x {n) = y, x - 

xSx(n) 

It is easily checked that this definition does not depend on the choice of the 
sequence {x( n )}: but only on its value at infinity I\ Using this definition, we 
define the a-integral. 

Definition 17 Let f : A — > K be any function, where ACI. Its Alpha-integral 
on A, denoted by J A f(x) Ax, is the number in RU{±oo} defined as the shadow 
of the following hyperfinite sum: 



f f{x) Ax = ahl®- y, no) 

•' A \ {eHnA* J 



Notice that 



J f{x) Ax = sh(S A (a)) where S A (n) =- • J^ /*( 



i£H(n)ni 



Of course, if A = [a, b] is a closed interval, we adopt the usual notation 

The Alpha-integral J f(x) Air is defined for every function. In fact, while 
the sequence 



1 

n 



*) = -■ E n*) 

iei(n)n(o,i) 



may not have a limit in the classic sense, its a- value S\{a) is always defined. 
If the function / is Ricmann integrable then linin^oo S\{n) exists and coincides 
with the a- integral (notice that if a real sequence {ip(n)} has "classic" limit 
I £ RU {±00}, then it must be sh {(p(a)) = I). Thus the Alpha-integral actually 
generalizes Riemann integral. 

3 Grid functions 

A grid function is a function whose argument range on an hyperfinite "grid" 
whose elements are the (hypernatural) multiples of — . Since the grid is hyper- 
finite these functions are easy to handle and from many points of view they 
behave similarly to functions on finite sets. We will see that this simple kind 
of functions are flexible enough to contain elements representing distributions. 
This flexibility will allow us to obtain in a simple way a kind of stochastic 
calculus ( see, e.g. the Ito's formula, Thm. I2TJ) . 

3.1 Basic notions 

An internal function 

£ : H -» M* 

is called grid function. 



Definition 18 Given a grid function £ : H — > M* , we define its grid derivative 

m as 

A£ m _ £(* + ©)-£(*) 
At [) ~ ® 

The grid integral of £ is defined as 



[£]=©££(*); 



1 ei 



if r C H is a hyperfinite set we define Ip [£], its grid integral in T, as 

Ir [£] = ©££(*) 

ter 



10 



Most of the properties of the usual derivative hold also for the grid derivative, 
for example we have that, if £ and £ are continuous functions, with finite grid 
derivative, 



A(£C) 
At 



£(* + ©)C(* + ©)-£(*)£(*) _ 

© 

e(* + © )c(* + © ) - e(t + © )cw + a* + © )cw - ewcw 



- gw-cw + ^ + ©).§(^§-c + £-§. 

These notions can be easily extended to functions of more variables; for 
example if 

p(t,x) :ixI^M* 

we set 

p(t + d),x) - p(t,x) 



A~x^ x) 



p(t,x + (i)) - p(t,x) 



and if r C H 2 is a hyperfinite set we define its grid integral Ir [p] as 

Ir[p] = © 2 J2 P^ x )- 
(t,x)er 

It is clear that the derivative of a grid function £ is a grid function. Moreover, 
if £ is a grid function, then the grid integral function x i— ► \ a ,x) [£] i s a grid 
function. We have the following relation between the grid-derivative and the 
grid-integral: 



Theorem 19 If £ is a grid function, then 



"■[x,y 

A 
Ax 
Proof. Obviously we have 



Ax 

I[a,x) K] 



£(»)-£(*) 

£(*) 



A£ 



E 



$(»+©)-$(*) , ax + 2-®)-t(x + ®) 



^(y) -av-®) 



ay) - Z(x) 



Furthermore 



A^W)K1 ' @ 

= E £(*)- E £(*) = *(*)■ 

t£[a,x+©) ie[o,x) 
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Definition 20 of A grid function £ is called integrable in [a, b] if I[ a ,6] [£] * s 
finite; in this case, we set 

f £{a) As := sh (l [0)6) [£]) = sfc ( © ]T £(*) I 

£ is called absolutely integrable in [a, b] if Ir ,&) [|£|] *s finite. J £(s) (is wraZZ fee 
called a-integral o/£. 

Of course, this integral is strictly related to the a-integral given in Def. [T71 
In fact, to every real function 

/:[a,6]->R 

it is possible to associate its natural extension 

f :[a,6]*^R* 

and a grid function 

/:[a,6] H ^R* (6) 

obtained as restriction of /* to [a, b] M . When no ambiguity is possible we will 
denote /* and / with the same symbol. 

The a-integral of / coincides with the a-integral of / given by Def. [20j 

3.2 The Ito formula 

We show the power of the grid functions approach by stating in a very simple 
way a proposition which is, in some sense, a variant of the Ito's formula. As in 
the standard approach this formula will be the main tool in the study of grid 
stochastic equations. 

Theorem 21 (Nonstandard Ito's Formula) Let <p e Cq(R 2 ) and x(t) be a 
grid function such that 

^(i)|<^ 2/3 , (7) 

where r\ ~ 0. 
Then 

A Ax , , (e) , , „ / Ax '■ ' 2 



Ai <f(t,x(t)) ~ <p t (t,x(t)) + V> x (t,x(t)) — (t) + >f<p all ,(t,x(t)) ■ [-^(t) 

Here tp t , (p x and ip xx denote the usual partial derivative of ip. 
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Proof. By definition of grid derivative we have that 

A^ (f '* W) = © + © 

V>{t,x(t + ®))-v(t,x(t)) 



<p t (t, *(* + ©)) + 
<p t (t,x(t)) 



(3 
tp(t,x(t + ®))-<p(t,x(t)) 



But 



r :(/..r(/ f'3.)l=y (*>*(*)+©^f(*) 



and |© ^| (4)1 < 77a 2 / 3 • © = 77 • © 1 / 3 is infinitesimal. Then, using the Taylor 
formula (Theorem [15)) . we have that 

(p (t,x(t) + ®^(t)j = <p(t,x(Jt)) + <p a (t,x(t))'®^(f) 

1 / , 1, /„ Aa; , , N 
+ -^(t,x(t))^©— (t) 

1 / A \ ^ /A x ^ 

+ 3jV MB (*,*(*))f©-^(*)J +<5 (©AlW 

where 5 is an infinitesimal; hence 

^ + ®»-^)) = p . (t>xW) ^ w + |^ (tja(t)) .(^ (t) xa 

© 2 /Ax, A 3 r ^ 2 /Aa;. . 

By the assumption ([7]) the last two terms are infinitesimal and we get the re- 
quired result. ■ 

3.3 Distributions and grid functions 

The grid functions can be considered as a sort of generalization of the usual real 
functions. 

In fact to every real function correspond a unique grid functions given by 
(JSJ). In the traditional analysis the most important generalization of the real 
function is given by the distribution. In this section we will show that the grid 
functions represent also a generalization of the notion of distribution. 

First of all we recall some notation: given a set A C R , T> (A) denotes the 
space of C°° functions with compact support of A. The space of the distributions 
V {A) is the topological dual of I? (A) when T> (A) is equipped with the Schwartz 
topology. 
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Actually, T>' (A) can also be constructed without knowing the Schwartz 
topology by using the notion of grid function. Next, we will show how to do it. 
Let & (A) denote the set of grid function defined on 

A n := A* n U N 

On 25 (A) we define the following equivalence relation: 

Definition 22 Two grid functions ^, £ 2 are sa id t° be equivalent if 

v<pev, / (& - &) ¥«*» = o 

In this case we will write 

We may think that two grid functions are equivalent if they are macroscop- 
ically equal. 

Moreover, we set 

0o {A) =Ue0(i):V^€ V, l An [&] is finite] 

The set of distributions V (A) can be defined as follows 

V (A) = **<&. 

Thus a distribution can be considered as an equivalence class T^ of some grid 
function £ e ©o (A) . 

Tj can be identified with an element of V (A) by the following formula: 



&,?)= [ frd8 = 8h(® ■ ££(%*(*)), ^ev. 

Ja \ teA a J 



(8) 



To each distribution we can associate a grid function. For example, if T £ 
V' (R) we can do in the following way. Since a distribution T has the following 

representatioro 

00 

T = Y,D k f k 

k=a 

where fk are continuous. Then the grid function £ corresponding to T is given 

by 

- A k 

fc=0 

Let us see some simple example. The function 

5(t) = ad ,t 



See Rudin, functional analysis, Th. 6.28, pag.169 
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where 8ij is the Kronecker symbol correspond to the Dirac S. But also the 
following grid functions 



8o,t + $A,t 

a- 



53^0,t+i (fee Z); etc 



2 
correspond to the Dirac 5. The grid function 

— (t) = a 2 (<5 ,t ~ <% , t ) 

correspond to 5 . 

The grid function a 2 <5o,t is not in ©o 0*4.) and hence it does not correpond to 
any distribution. 

4 Stochastic differential equations 

4.1 Grid differential equations 

A grid ordinary differential equation is a differential equation whose time step 
ranges on the hyperfinite grid. This fact makes it to work as a discrete time 
object simplifying many formal aspects. 

A grid ordinary differential equation is then an equation of the kind 

At 

— (t) = f(t,x(t)), (9) 

where t £ MI , x(t) is a grid function and / : H x R* — > R* is an internal function. 
A grid function x(t) is a solution of the grid equation if satisfies it at each point 
of the grid. 

The following result shows that such an equation has an unique solution. 
This, without regularity assumptions on the /. Hence, this kind of equations 
has solutions even if the equations contain a noise term (see section (|4.2)l ). 

Theorem 23 Given an initial time to £ H and an initial data xq £ M* ; the 
Cauchy problem associated to (ty), that is 

£f(i) = f(t,x(t)) t£W 

a; (to) = xo 

admits for t > t Q an unique solution x : H — > M* . 

Proof. We know that / is an ideal value of a sequence {/ n }neN- Also, we 
have that to — io, Q , xq — xq, u are the ideal values associated to {£o,n}neN, {xo,n}n& 
For each n £ N and m £ Z, we can construct by induction a sequence of func- 
tions. 

x n :-Z->E 
n 
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as follows: 








$n \^0,n) 


= a;o,n 


$n 


[to,n+ n ) - 


/ m 

= X n to.n H 

V n 

+ — fn ( \to,n 



(11) 

+ (12) 

-),x n (to,„ + -)). (13) 

Then by definition of internal function we have that, for t = to + m ■ © , m £ Z* 

£ a (t ) = zo (14) 

.t q (< + ©) = x a (t) + ©f a (t,x a (t)). (15) 

Thus x ~ x a solves ([TU| . It is easy to check that this solution is also unique. 



Given xq ~ xi, it may happen that 

x(t,Xo) ^ x(t,Xi) 

where x(t, Xi) is the solution of (jTUJ) with initial data Xj. Some times we would 
like to have 

x(t, xo) ~ x(t, x{) Vxi ~ xo; (16) 

this can be useful, for example, when we want to consider the standard part of 
a hyperreal differential equation. 

We have the following proposition : 

Proposition 24 Consider the following Cauchy problem 

' £f(t) = /(*,»(*)) feB; 
x(i ) = £o, 

and suppose that, there exists L s.t. 

\f(t,x)-f(t,y)\<L\x-y\. (18) 

Let x\ be a bounded initial data for the problem J_?7| ) and x(t,X\) is the 
solution of this problem. Then , if Xi ~ Xq , then for all < t < T we have 

x(t, Xq) ~ x(t, X\) (19) 

Proof. Arguing as in standard analysis, we have that (fTS]) guarantees that, 
for any T± < X,the solution is bounded. Moreover, in standard analysis, the 
condition (|18| guarantees, the continuous dependence of the solution from initial 
data xq. In our case, until the solution is finite, we can proceed in the same 
way to prove that, chosen an arbitrary T\ < X 

\x(t,x a )-x{t,x 1 )\ < Ixo-x^e^ 1 (20) 

for all t £ [0, Xi]. Because xo ~ x\ by hypothesis, we have that 

x(t, Xo) ~ x(t, Xi) (21) 

for all t £ [0, Xi]. This assures the proof. ■ 
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4.2 Stochastic grid equations and the Fokker-Plank equa- 
tion 

In our approach, a stochastic differential equation consists of a set of grid dif- 
ferential equations. Each differential equation has a noise term and gives a 
trajectory which can be considered as a realization of a process. 

Let 1Z C <5 [0, 1] be an hypcrfinitc set of grid functions and consider the class 
of Cauchy problems 

%{t) = f(t,x) + h{t,x)£, 

x(0)=x , (22) 

m e n. 

where 

/,fc:[0,l] H xR*-R* 

We want to study the statistical behavior of the set of solutions of the above 
Cauchy problems 

S = { Xi (t):ZeK}; 

More precisely we want to describe the behavior of the density function 

p : [0, 1] H x H -» Q* 

defined as follows 

\{x e e S : x < x i (t) < x + Q)}\ 



p (t, x) 



(e)\K\ 



We are interested in the case in which 1Z models a white noise; roughly speak- 
ing we can define a white noise as the hypcrfinitc set of all the grid functions 
with values ±y/a. Here there is its precise definition: 

Definition 25 The white noise is the set of grid functions defined by 

K = K a 

where 

7^ = {-V^+V^} [Mh " 

Hence 1Z is a hypcrfinite set with \R\ = 2 a+1 . 
Remark 26 We would be tempted to write 



TZ a = {— \/a, +Va} 



[0,11b 



however this notation is very ambiguous; in fact lZ a is a set defined by the 
Internal Set Axiom and it contains only internal function. However the symbol 
{—^/a 1 +^fa] ' Q usually represents the set of all the functions f : [0, 1] H — > 
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Now, we can state the main result of this paper: 

Theorem 27 Assume that 1Z is a white noise and that f(t,x) and h(t,x) are 
continuous functions. Then the distribution T p relative to the density function 
p is a measure and satisfies the Fokker-Plank equation 

f + i;W,zK)-\£(Ht,*)%)=o. (23) 

T„(0,z) = 6 (24) 

in the sense of distribution. 

Remark 28 We recall that \2S\) and \2I$ "in the sense of distributions" mean 
that T p satisfies the equation 

Vt + f^. + lh 2 ^ xxl T p ^+v(x )=0 (25) 

for any if S T> ([0, 1) xl). The duality (■, •) is between the space of continuous 
function and the space of measures. Equation Ii25\) can be expressed using the 
grid function p and the a-integral by the following equation: 

^ £P([0,l)xl), J J (tp t + f<p x + Vxx h 2 ) P AxAt + <p(0, x ) = (26) 

Actually, we will prove Th. \27\ just proving the above equation. 

Remark 29 If f(t,x) and h(t,x) are smooth functions, by standard results in 
PDE, we know that, for t > 0, the distribution T p coincides with a smooth 
function u(t,x). Then, for any t > 0, p defines a smooth function u by the 
formula 

y<p e V ((0, 1) x R) , / / pip Ax At = utp dx dt 



and u satisfies the Fokker-Plank equation in (0, 1) X R. in the usual sense. 



Remark 30 We will see in the proof of Th. \21ty that if the functions f{t,x) 
and h(t,x) are not continuous, but only bounded on compact sets, the equation 
i26\) still holds. However in this case, equation \26\) cannot be interpreted so 
easily. For example, if f(t,x) and h(t,x) are not measurable, there is no simple 
standard interpretation. 



Given t € [0, 1] H we set 

■R.[0,t)=n a [0,t); ft„ :={-v^,+\M [ °' t)H "; (27) 
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namely, 1Z[0,t) is the set of the restrictions of the functions of TZ to [0,£)h- 
Moreover, for r <G 7?. [0, s) , we set 

fcr M] = {?£K:( (t) = t (£) for £ < s} 
So we have the following decomposition: 

K= (J ft T [s,l]- (28) 

t£K[0,s) 

We define the mean value of a grid function in the set [x, y] (~1 H as follows: 

vy y vy y te[x,i/)nH 

In general, if T is a hypcrfinitc set and $ : T — ► R* is an internal function, 
the mean value of $ in T is defined as follows: 



W*] = irE*(0 



Proposition 31 If TZ is a white noise, then for any t € [0, 1] H , and r <G TZ [0, t] , 
we have 

the hyperfinite number \TZ T [t, 1]| does not depend on r £ TZ [0, t] (29) 

and 

E £ e7MM] K(*)] ~ (30) 

%7MM] [^(t) 2 ] ~ a, (31) 

Proof. The proof is almost immediate: first of all we have that 

|ft r [i,l]|=2 a(1 -* )+1 ; 



moreover 



1 L ' Jl ?eTC T [t,i] 






2|^ r [£,l]| v 2|ft T [t,l]| 

and 



Wm [^) 2 ] = i^pji E £(*) 



2 - a 



{eRr[t,i] 
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Remark 32 The conclusion of Th. YS^hold not only if the "stochastic class" 1Z 
is defined by {27}) , but for any class 1Z which satisfies the properties A29\), I130\) 
and \31)) . For example we can take 

K = K a ; K n :={ qi Vn-,.... iqk V^} l0 ' 1]u "; keN 
with q t e M.*, 

k k 

^^ = 0; 53^ = 1. 

i=l i=l 

The following two lemmas are a direct consequence of properties (|29|) , (|30l) 
and (|3Tj) . 

Lemma 33 Let G : [0, 1] H xM'xM'^l' be any internal function. Then, for 
every t € [© , 1] 

E ielz [G(t, Xi (t) , £ (*))] = E TenQ , t) E ieKT[L1] [G(t, Xi (t) , £ (*))] 

Proof. By (gSJ, we have that 

ft = U n T [t, 1] 

TS"R.[0,t) 

Then, 

Eenw^um - lllT [t,ilmo,t)\ E E o^w.ew) 

1 L ' Jl ' L ' yl reTC[0,t)?eK T [t,l] 

- ^ Era E <*(*.*«(*),*(*)) 

1 L ' n reTZ[o,t] ' L ' Jl £ew T [*,i] 
= E reK[0 , t) E ?e7MM] [G(t, x c (t) , £ (*))] . 



Lemma 34 Let JF : [0, l] H xR* — > R* oe an internal function such that \F(t, x)\ < 
M, M e M. Tften, /or every £ € [0, 1] 

E €eW [F(i, a* (*))■£(*)] ~ 0. 
E CeW [>(t,a*(i))-£(t) 2 ] - a-E Ce7i [F(t,x e (i))] 

Proof. By lemma l33l we have that 

Ec en [F(t, Xi (t)) ^(t)] =E renm E ienAtA] [F(t,x s (t))-m] 

Since x^(t) does not depend on £(s) for s > i, we have that 

= F(t,a*(t))-Ee 6Wr [t,i]K(*)] 
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Then since F is bounded, by ([50]) , we get the conclusion: 

E£ G7 i[F(t,a*(t))£(i)] =E TeW[0 ,t) [F(t,^(t))-E eeKr[t]1] [£(*)]] ~ 
Analogously, we have that 

E ?eK [F(i,x e (t))^(i) 2 ] =E reK[0)t) [F(i,^(i))E ?G7MM] [£(£) 2 ]] 
and by (|3"Tj) . we get that 

F(t, Xi (t))E ienAL1] [t;(t) 2 ] = F{t,x i (t))(a + e T ) 
where e T ~ 0. Then 

E CeK [F(i,x ? (i))e(t) 2 ] = aE TeR | 0!t) [F(t,a: £ (t))] +E TeW[0>t) [F(t,a; £ (t))e r ] 

- aE Te7 j [0jt) [F(t,ar e (i))] 
This concludes the proof. ■ 

Now we see a basic property of the density function. 

Lemma 35 Let ip £ T> ([0, 1] x R) and Zei x^(t), £ £ 11, be the family of solu- 
tions of a grid stochastic ODE. Then 

E^e-R [<p(t, X£(t))] ~ © £ ^' ^M*' x ">- 



xen 



In particular, 



E^ n [ip(t,xz(t))] ~ / <p(t,x)p(t,x)Ax 
Proof. We have 



£erc 



xeH 



^E^^w)4E 



i^ 



CeK 



igi 



x<a;^(i)<a;+© 



53 (^(t,x)+r? e (t)) 

x<x^(t)<x-\-(e) 



where TjAt) '.— (p(t^x^(t)) — (p(t,x) is infinitesimal. Hence 



and so 
E ee7 ^(t,^(t))] = p^E 



x<x e (t)<a;+© 



x6H 



— 53 [ip{t, x) ■ © \n\ P (t, x )] = © 53 be*, »)/(.(*, x)] 



irc 



1-61 



,-rel 
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Now we can prove Theorem 1271 

Proof. Chosen an arbitrary ip £ C([0, l]xl) bounded in the second variable, 
we have that 



p(l,aj e (l))-p(0,ao)=© Yl 



te[o,i-©] H 



Atp 
~At 



(*,x c (t)), 



Now we assume that <p £ 2?([0, 1) x R)). Since x% solves eq. (|22l) . we can apply 
Theorem |2"T1 and we obtain 



<p(p,x ) ~ (§) Yl 

te[o,i) 



Ax 



Ax 



© E 

te[o,i) 



^ + ^' At + T^'VAt 



^ + ^-(/ + ^) + Y^'(/ + ^) 2 



(32) 



te[o,i) H 

Now we want to compute E^ e u of each piece of the right hand side of the 
above equation: by lemma |31>I 



E ££TC [ft + ffx\ = © E ^t + ffa 



(33) 



xeH 



for every t £ [0, 1] . 

Let us consider the second piece: {<p x h + © f xx f) £■ If we se t 

F(t,x s (t)) = f a; (t,x s (t))h(t,x ( {t)) + ®<p xx (t,xt{t))f{t,xs(t)) 

it turns out that F(t,x^(t)) is bounded. In fact if x^(t) is bounded, ip x (t,x^(t)), 
h(t,x^(t)), ip xx (t,x^(t)) and f(t,x^(t)) are bounded since they are standard 
functions; if X^(t) is unbounded, ip x (t, x^(t)) — 0. Then we can apply lemma I3"4l 
and we get that 

Ee[W* + ©¥' XB /)£]~0. (34) 



Moreover 



since f^J~0. 



© - 

-^fxxl 



(35) 



Finally, let us see the last term: &-ip xx h 2 t; 2 . We can see that ^-<p xx h 2 
is bounded (actually, it is infinitesimal); then we can apply lemma [Ml with 



F(t,X S (t)) = ®.ip xx (t,xt(t))h(t,xz(t)) 2 : 



22 



Ef 



'® *,2 t 2~ 


~ aE c 


'© ,2" 
-jVxxh 




= 1* [****>*] 




l V^„ u 



2K 



(36) 



?£R 



Then, by lemma [ 



Ec 



2/-2 



Then, by (02 
-V?(0,a;o) 



2 

.,(06 

E 



V?^ £ 



X)^aK<:(*' a; )M*.*) 2 P(*> a; ) 



i£K 



t, ? [-V3(0,X )] 

te[o,i) H 



© E ( E ? 

*6[0,1) H 



-Vxx/ 



■E F 



u2t-2 



© 2 J2 \Y.^ + f^x)p+v xx h 2 p\ 

te[o,i)„ \xem ) 

(<Pt + f<Px + <Pxxh 2 )pdxdt 



Since the first and the last term of this equation are standard, we get eq. 



JM}. 



5 Conclusion 



In this section we will make some comments suggested by the results of this 
paper. 

The first comment is that, 

• in many applications of Nonstandard Analysis, only elementary facts and 
techniques of nonstandard calculus seem to be necessary. 

In fact a-theory is much simpler than the usual Nonstandard Analysis, but 
it seems absolutely adequate to treat the kind of problems considered here. 



_>:-! 



However, if you compare this paper with [2J, [5] and [TT], the reason why 
this paper is much simpler lies in the fact that we have not made a nonstandard 
theory of the stochastic differential equation, but rather, we have replaced them 
with the stochastic grid equations which are much simpler mathematical objects; 
in this contest the Ito integral is replaced by the a-integral and the proof of the 
key point of this theory, the Ito formula, reduces to an exercise. 

The reason why a much simpler object can represent the processes of diffu- 
sion at microscopic level is that we have taken the infinitesimals seriously and 
we have used them to model an aspect of the " physical reality" . We think that 
in general, 

• the advantages of a theory which includes infinitesimals rely more on the 
possibility of making new models rather than in the demonstration tech- 
niques. 

In the case considered in this paper, the way the model has been constructed 
appears quite natural: the stochastic grid equations, which need the notion of 
infinitesimals, describe the diffusion processes at microscopic level; the Fokker- 
Plank equation describes the diffusion processes at the macroscopic level and 
uses standard differential equations (and the theory of distributions when the 
data are not regular). 

The connection of these two levels is given by eq. §8§ which relates grid 
function (microscopic level) with distributions (macroscopic level). 

A final remark concerns the theory of probability. We have not used the 
notion of probability to show that every thing can be kept to a very elementary 
level and that our variant of the Ito formula makes sense also in a context where 
probability does not appear. 

However probability can be introduced in a very elementary way. We may 
think of the stochastic class 11 as a sample space. The events are the hyperfinite 
sets £c? (T^-)* and the probability P of an event is given by 

\E\ 
nE) = ^ y (37) 

For example E might be the event that at a time to , the moving particle lies 
in the interval [a, b] , namely 

E = {x^ E S : X£ (t ) € [a, b] u } 

In this case we have that 

b 



P(E)~ / p(t,x) Ax 

J a 



Obviously, this is the most natural extension to infinite sample spaces of 
the classical definition of probability. There is no doubt that definition ([37]) . 
is the most simple and intuitive definition of probability. The price which you 
pay is that P takes its values in Q* and not in R; thus, a probability space 
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(1Z, P) is an internal object and, if it is infinite, it is not standard. The problem 
arise if you want to connect the nonstandard world with the standard one. This 
operation can be done in a very elegant way via the Loeb integral (see [9i or also 
[8 ). A different way to make easy the connection between this two worlds has 
been proposed by Nelson [TT]. However, we think that, in most cases, it is not 
necessary to make this connection. Only at the very end you may consider the 
shadow of the numbers which you have obtained. At least this can be done in 
the theory which we have exposed in this paper. So, if we accept a mathematical 
description in which the infinitesimal exist, 

• the probability of an event is given by a hyperrational number, 

and many theorems become simpler. This scheme avoids also some facts in 
probability theory which are in contrast with the common sense: for example 
the fact that the union of impossible events gives a possible one (the probability 
of a non-denumerable set might be non-null even if the probability of each 
singleton is null). 

Concluding the final remark of this paper could be the following one: 

• the infinitesimals should be taken seriously. 
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